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Abstract 

The similarity in the thermodynamic properties of two completely different 
theoretical models for the helix-coil transition is examined critically. The first 
model is an all-atomic representation for a poly-alanine chain, while the second 
model is a minimal helix-forming model that contains no system specifics. Key 
characteristics of the helix-coil transition, in particular, the effective critical 
exponents of these two models agree with each other, within a finite-size 
scaling analysis. 
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The importance of understanding the statistical physics of the protein-folding 
problem has been stressed recently [|. For instance, it is now often assumed that 
the energy landscape of a protein resembles a partially rough funnel. Folding occurs 
by a multi-pathway kinetics and the particulars of the folding funnel determine the 
transitions between the different thermodynamic states |l|, [|. This "new view" 
of folding was derived from studies of minimal protein models which capture only a 
few, but probably dominant parameters (chain connectivity, excluded volume, etc.) 
in real proteins. 

An implicit yet fundamentally crucial assumption is that the basic mechanism of 
structural transitions in biological molecules depends solely on gross features of the 
energy function, not on their details, and that a law of corresponding states can be 
used to explain dynamics and structural properties of real proteins from studies of 
related minimal models. This assumption needs to be proven. An even stronger no- 
tion in statistical physics is the universality hypothesis for critical phenomena. The 
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critical exponents are identical for different theoretical models and realistic systems 
belonging to the same universality class. Many theoretical concepts in protein fold- 
ing, such as coil-helix or coil-globular transitions involve phase transition or phase 
transition-like behavior. Thus, one wonders if physical measurements between two 
model systems for the same transition would have any "universal" properties. 

The purpose of this article is to examine these questions for the helix-coil tran- 
sition in homopolymers of amino acids @, [|. Traditionaly, the coil-helix transition 
is described by theories such as the Zimm-Bragg model || in which the homopoly- 
mers are regarded as one dimensional systems with only local interactions; as such 
a true thermodynamic phase transition is impossible. However, recently there have 
been j|, || indications that the coil-helix transition near the transition temperature 
displays phase-transition like behavior. We use here finite-size scaling analysis, a 
common tool in statistical physics, to examine the question of universality of the 
helix-coil transition in two completely different, illuminating models. On one hand, 
we have a detailed, all-atomic representation of a homo poly-alanine chain 0. On 
the other hand, we have a simple coarse-grained model describing the general fea- 
tures of helix- forming polymers [[|. In this article, our interest lies in finding out how 
far the similarity of the two models go. If the two models yield the same key physi- 
cal characteristics, then we at least have one concrete example of the validity of the 
corresponding state principle or universality hypothesis in biopolymer structures. 

Poly-alanine is well-known to have high helix-propensities in proteins, as demon- 
strated both experimentally and theoretically || [7| . It has been well tested and gen- 
erally believed that approximate force fields, such as ECEPP/2[|| as implemented 



in the KONF90 program |ID |, give protein-structure predictions to a surprisingly de- 
gree of faithfulness. As our first model, we have "synthesized" poly-alanine with N 
residues, in which the peptide-bond dihedral angles were fixed at the value 180° for 
simplicity. Since one can avoid the complications of electrostatic and hydrogen-bond 
interactions of side chains with the solvent for alanine (a non-polar amino acid), we 



follow earlier work [0] and neglect explicit solvent molecules in the current study. 

Our second model is a minimalistic view of a helix forming polymer || with- 
out atomic-level specifics. A wormlike chain is used to model the backbone of the 
molecule, while a general directionalized interaction, in terms of a simple square 
well form, is used to capture the essence of hydrogen like bonding. The interaction 
energy between the residue labeled i and j is modeled by, 

Vij(T) = 

where v = e[uj • f ijf + e[u.j • r^} 6 , iij = x (r^-i), is the unit vector between 




monomer i and j, D = 3 /2a is the diameter of a monomer, a = y45/8a is the 
bonding diameter, and a is the bond length while bond angle is fixed at 60°. 

To obtain the thermodynamic properties, we have conducted multicanonical 
Monte Carlo simulations for both models. In the low-temperature region where 
most of the structural changes occur, a typical thermal energy of the order ksT is 
much less than a typical energy barrier that the polymer has to overcome. Hence, 
simple canonical Monte Carlo or molecular dynamics simulations cannot sample 
statistically independent configurations separated by energy barriers within a finite 
amount of available CPU time, and usually give rise to bias statistics. One way 
to overcome this problem is the application of generalized ensemble techniques [11] , 
such as the multicanonical algorithm [[HJ used here, to the protein folding problem, 
as has recently been utilized and reported Jl3 . 



In a multicanonical algorithm [12| conformations with energy E are assigned a 
weight w mu (E) oc l/n(E), n(E) being the density of states. A simulation with this 
weight generates a random walk in the energy space; since a large range of energies 
are sampled, one can use the re-weighting techniques |L4| to calculate thermody- 
namic quantities over a wide range of temperatures by 



dx A(x) w-i(E(x)) 
(A) T = J — r , (2) 



where x stands for configurations and (3 is the inverse temperature. 

In the case of poly-alanine chains, up to N = 30 alanine residues were consid- 
ered. The multicanonical weight factors were determined by the iterative procedure 
described in Refs. and we needed between 4 x 10 5 sweeps (for N = 10) and 
5 x 10 5 sweeps (for N = 30) for estimating the weight factor approximately. All 
thermodynamic quantities were measured from a subsequent production run of M 
Monte Carlo sweeps, where M=4 x 10 5 , 5 x 10 5 , 1 x 10 6 , and 3 x 10 6 sweeps for 
N = 10, 15,20, and 30, respectively. In the minimal model, chain lengths up to 39 
monomers were considered. In this model a single sweep involves a rotation of a 



group of monomers via the pivot algorithm |Tj|. For the weight factors the similar 
number of iterative procedure was used, and for the production run 1 x 10 8 sweeps 
was used in all cases. 

We obtain the temperature dependence of the specific heat, C(T), by calculating 

C(T) = f3 2 ^'ot) ~ (£tot) 2 ^ (3) 

where E tot is the total energy of the system. We also analyze the order parameter q 
which measures the helical content of a polymer conformation and the susceptibility 

x(t) = j^((q 2 ) - (q) 2 ) • (4) 

associated with q. For poly-alanine q is defined as 

q = h H (5) 

where njj is the number of residues (other than the terminal ones) for which the 
dihedral angles (<f),ip) fall in the range (—70 ± 20°, —37 ± 20°). For our worm-like 
chain model the order parameter q is defined as 

N-l 

q= J2 u i- u i+i (6) 

i=2 

In both cases the first and last residues, which can move more freely, are not counted 
in the procedure. 



From a finite-size scaling analysis of the heights and width of specific heat and 
susceptibility we can extract a set of effective critical exponents which characterize 



the helix-coil transition in these two models [16]. For instance, with Cmax defined 



to be the maximum peak in the specific heat, we have 

a 

Cmax (xNdv . (7) 

In a similar way, we find for the scaling of the maximum of the susceptibility 

7_ 

Xmax oc Ndv . (8) 

For both quantities we can also define the temperature gap V — T 2 —Ti (where T\ < 
Tmax < T 2 ) chosen such that C(Ti) = bC M Ax = C(T 2 ), and x(Ti) = bx(T c ) = x{T 2 ) 
where b is a fraction. The temperature gap obeys 

1 

r = T 2 - Ti oc N dv , (9) 



as has been suggested in Ref. JT6[ . The analysis should be insensitive to the actual 
fraction, b, of Cmax (xmax) considered for defining T\ and T 2 which was verified 
from our numerical data fitting of poly-alanine chains. 

The scaling exponents, a, v, and 7, have their usual meaning in critical phe- 
nomena; however, the above scaling relations also hold formally for the case of a 



first-order transition, with effective scaling exponents dv = a = 7 = 1 [0, [IT]. Note 
that d is the dimensionality of the system, and it always appears in the combination 
dv. Without knowing further the effective dimensionality of our systems, we use the 
combination dv as a single parameter in the fit. 

It then becomes straightforward to use the above equation and the values given 
in Table [I] to estimate the critical exponents. We obtain for poly-alanine from the 
scaling of the width of the specific heat 1/dv = 1.02(11) with a goodness of fit 



(Q = 0.9) (see Ref. |18| for the definition of Q), for chains of length iV = 15 to 



iV = 30. Inclusion of N = 10 leads to 1/dv = 0.84(7), but with a less acceptable 



fit (Q = 0.1). Similarly, we find from the scaling of the width of the susceptibility 
1/dv = 0.98(11) (Q = 0.5) for chains of length iV = 15 to iV = 30 and 1/dv = 0.81(7) 
(Q = 0.2) when the shortest chain N = 10 is included in the fit. Hence, we present 
as our final estimate for the correlation exponent of poly-alanine dv = 1.00(9). 
This value is in good agreement with the estimate dv = 0.93(5) obtained from the 
partition function zero analysis in Ref. ||. 

The results for the exponent a give a = 0.89(12) (Q=0.9) when all chains are 
considered, and a = 0.86(10) (Q = 0.9) when the shortest chain is excluded from the 
fit. Analyzing the peak in the susceptibility we find 7 = 1.06(14) (Q = 0.5) for chain 
lengths N = 15 - 30 and 7 = 1.04(11) (Q = 0.5) for chain lengths JV = 10 - 30. We 
summarize our final estimates for the critical exponents in Table 0. The scaling plot 
for the susceptibility is shown in Fig. 1: curves for all lengths of poly-alanine chains 
collapse on each other indicating the validity of finite size scaling of our poly-alanine 
data. 

The same procedure can be applied to analyze the data from the minimal model. 
All calculation has been done with the omission of the shortest chain. Using the 
widths of the specific heat a b = 80% of the peak height we obtain 1/dv = 1.03(7), 
(Q = 0.2). The width of the peak at half maximum is more unreliable in this case as 
the coil-helix transition is complicated by the additional collapsing transition to a 
globular state in the vicinity of the coil-helix transitionfQ. This exponent agrees with 
that calculated from the susceptibility widths, 1/dv = 0.89(9), (Q = 0.3). Hence, 
our final estimate for this critical exponent in our second model is dv = 0.96(8). 
These values are in good agreement with those of the poly-alanine model. 

From the Cmax data in Table [I] and using the above given value for the exponent 
dv we find a = 0.70(16) (Q = 0.3) which is somewhat smaller than that of the 
poly-alanine model. The susceptibility exponent as calculated from the data in 
Table [I] yields a value of 7 = 1-3(2) (Q = 0.5), which agrees with the previous 
estimation within the error bar. The scaling plot for the susceptibility is shown in 
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Fig. 2. While curves corresponding to large polymer sizes collapse into the same 
curve, the N = 13 case shows small disagreement, indicating that the finite size 
scaling are valid only for longer chain lengths in the minimal model. 

Comparing the critical exponents of our two models as summarized in Table ^ 
we see that the estimates for the correlation exponent dv agrees well for the two 
models. Within the error bars, the estimates for the susceptibility exponent 7 also 
agree. The estimates for the specific heat exponent a seem disagree within the 
error ranges. However, in view of the fact that both analyses are based on small 
system size the true error ranges could be actually larger than the ones quoted here. 
Using these rather crude results, we have already demonstrated a striking similarity 
in finite-size scalings of the two model. Therefore, we can convincingly make the 
conjecture that minimal model can be used to represent the structural behavior of 
real helix-forming proteins. 

Our analysis should tell us also whether the helix-coil transition in our models is 
of first or second order. In the former case we would expect dv = a = 7 = 1 which 
seems barely supported by our data due to the rather large error bars associated 
with the estimate of the exponents. We have further explored the nature of the 
transition from another perspective, by considering the change in energy crossing a 
small temperature gap (taken to be within 90% of Cmax) from the original data, 

AE = (E tot (T 2 ) - E tot (Ti))/N (10) 

This value should approach either a finite value or zero as iV -1 goes to zero. A 
finite value would indicate a first order transition while a zero value a second order 
transition. In the case of a first order transitions the intercept would indicate the 
latent heat. Now, the assumption is that this energy change scales linearly as N^ 1 
goes to zero. Figure 3 shows a plot of the data from both the atomic-level and 
minimal models, where nonzero intercepts can be extrapolated at iV -1 =0. Hence, 
our results seem to indicate and finite latent heat and a first-order like helix-coil 
transition. However, we can not exclude the possibility that the true asymptotic 
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limit of \E\ is zero, and some of the results of Ref. point for the case of poly-alanine 
rather towards a second-order transition. Further simulations of larger chains seem 
to be necessary to determine the order of the helix-coil transition without further 
doubts. 

In summary, we conclude that in view of the similarity of the two models ex- 
amined here, a corresponding state principle can be established for the coil-helix 
transition. Examining the finite size scaling analysis allows us to calculate esti- 
mators for critical exponents in the two models which indicate "universality" of 
helix-coil transitions. 
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Figure Captions 



1. Scaling plot for the susceptibility x{T) as a function of temperature T, for 
poly-alanine molecules of chain lengths N = 10, 15, 20 and 30. 

2. Scaling plot of x(T) as a function of temperature T, for the minimum model 
of chain lengths N = 13, 19, 26, 33 and 39. 

3. Scaling of energy gap and transition width at 80% and 90% of Cmax- Here 
we have used AE 80 %(A for all-atom model, O for minimal model), AE g0 %(O 
for all-atom model, Q f° r minimal model). 
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Table 1: Shown are the location of the specific heat maximum Tmax? the maximum 
of specific heat Cuax, susceptibility Xmax, the width of the half peak in specific 
heat T c , and width of the half peak of susceptibility T x for various chain lengths. 



N 


Tmax 


Cmax 


T c 


Xmax 




All-Atomic Model 


10 


427(7) 


8.9(3) 


160(7) 


0.49(2) 


140(7) 


15 


492(5) 


12.3(4) 


119(5) 


0.72(3) 


110(5) 


20 


508(5) 


16.0(8) 


88(5) 


1.08(3) 


78(5) 


30 


518(7) 


22.8(1.2) 


58(4) 


1.50(8) 


56(3) 


Minimal Mode 




13 


1.25(1) 


1.088(2) 


1.22(2) 


0.232(2) 


2.20(2) 


19 


1.17(1) 


1.424(5) 


1.12(2) 


0.353(3) 


0.81(2) 


26 


1.16(1) 


1.789(8) 


0.89(2) 


0.553(8) 


0.57(2) 


33 


1.13(1) 


2.08(1) 


0.73(2) 


0.78(1) 


0.45(2) 


39 


1.12(1) 


2.27(2) 


0.61(2) 


0.96(2) 


0.41(2) 



Table 2: Summary of the critical exponents obtained for the two models. 





All-atomic 


Minimal 


dv 


1.00(9) 


0.96(8) 


a 


0.89(12) 


0.70(16) 


7 


1.06(14) 


1.3(2) 
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